Modeling segment intonation for Slovene TTS system
نویسنده
چکیده
A scheme for modeling the F0 contour for different types of intonation units for the Slovene language is presented. It is based on results of analyzing F0 contours, using a quantitative model. Data from ten speakers was collected, resulting in a large corpora, mainly of declarative sentences. A way of generating the F0 contour for given utterances was defined, using only the text of the utterance as input. Near-to-natural synthesized F0 contour was obtained by rules which regard the F0 contour as the sum of global and local components.
منابع مشابه
Maximum-likelihood dynamic intonation model for concatenative text-to-speech system
In this work we present a Maximum Likelihood (ML) joint pitch curve modeling, inspired by HMM TTS synthesis concept. This model provides an optimal solution for the coarse target intonation curve (3 points per syllable) and incorporates both static and dynamic pitch values for better utterance intonation modeling. The coarse intonation curve may be optionally combined with the original pitch ex...
متن کاملModeling of intonation bearing emphasis for TTS-synthesis of greek dialogues
TTS-synthesis of neutral style Greek with good intelligibility and quality has been achieved some time ago. As a further step towards expanding the applications domain of the TTS-system developed in our laboratory, the incorporation of emphasis into speech used in man-machine dialogues according to their context has been studied recently. In this paper the method applied for the analysis of int...
متن کاملComparison of chironomic stylization versus statistical modeling of prosody for expressive speech synthesis
Chironomic stylization is the process of real-time modification of intonation contours (f0 and tempo) using drawing/writing gestures with a stylus on a graphic tablet. The question addressed in this research is whether hand-made intonation stylization could improve or degrade expressivity and overall quality, compared to statistical modeling of prosody. A system for expressive TTS in French bas...
متن کاملTowards an intonation module for a portuguese TTS system
In this paper, a correlation between the linguistic structure of the written text and the real intonation behavior of the read speech in European Portuguese language (EP) is presented. It is our belief that intonation behavior in EP can be strongly predicted from two main coordinates: the syntactic structure of the sentence and its pragmatic communicative function, in one way, combined with the...
متن کاملExploring the naturalness of several German high-quality-text-to-speech systems
The synthesis of near-to-natural F0 contours is an important issue in text-to-speech and crucial to the naturalness and intelligibility of synthetic speech. In earlier studies of the first author a model of German intonation was developed that is based on the quantitative Fujisaki-model. The current paper addresses a perception experiment comparing a TTS-system incorporating this new approach w...
متن کامل